STARS: Spatial-Temporal Active Re-sampling for Label-Efficient Learning from Noisy Annotations

نویسندگان

چکیده

Active learning (AL) aims to sample the most informative data instances for labeling, which makes model fitting efficient while significantly reducing annotation cost. However, existing AL models make a strong assumption that annotated are always assigned correct labels, may not hold true in many practical settings. In this paper, we develop theoretical framework formally analyze impact of noisy annotations and show systematically re-sampling guarantees reduce noise rate, can lead improved generalization capability. More importantly, demonstrates key benefit conducting active on label-efficient learning, is critical AL. The results also suggest essential properties an function with fast convergence speed guaranteed error reduction. This inspires us design novel spatial-temporal by leveraging important spatial temporal maximum-margin classifiers. Extensive experiments conducted both synthetic real-world clearly demonstrate effectiveness proposed function.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Learning from Noisy Label Distributions

In this paper, we consider a novel machine learning problem, that is, learning a classifier from noisy label distributions. In this problem, each instance with a feature vector belongs to at least one group. Then, instead of the true label of each instance, we observe the label distribution of the instances associated with a group, where the label distribution is distorted by an unknown noise. ...

متن کامل

Online Active Learning Methods for Fast Label-Efficient Spam Filtering

Active learning methods seek to reduce the number of labeled examples needed to train an effective classifier, and have natural appeal in spam filtering applications where trustworthy labels for messages may be costly to acquire. Past investigations of active learning in spam filtering have focused on the pool-based scenario, where there is assumed to be a large, unlabeled data set and the goal...

متن کامل

On Re nement and Temporal Annotations ?

This paper introduces the semantics of a wide spectrum language with a rich compositional structure that is able to represent both temporal speciications and sequential programs. A key feature of the language is the ability to represent partial correctness annotations expressed in temporal logic. A reenement relation is presented that enables reenement steps to make use of these partial correct...

متن کامل

Multi-Task Active Learning for Linguistic Annotations

We extend the classical single-task active learning (AL) approach. In the multi-task active learning (MTAL) paradigm, we select examples for several annotation tasks rather than for a single one as usually done in the context of AL. We introduce two MTAL metaprotocols, alternating selection and rank combination, and propose a method to implement them in practice. We experiment with a twotask an...

متن کامل

Multi-Label Active Learning from Crowds

Multi-label active learning is a hot topic in reducing the label cost by optimally choosing the most valuable instance to query its label from an oracle. In this paper, we consider the poolbased multi-label active learning under the crowdsourcing setting, where during the active query process, instead of resorting to a high cost oracle for the ground-truth, multiple low cost imperfect annotator...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2023

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v37i9.26301